Building a Pilot Software Quality-in-Use Benchmark Dataset

نویسندگان

  • Issa Atoum
  • Bong Chih How
  • Narayanan Kulathuramaiyer
چکیده

Prepared domain specific datasets plays an important role to supervised learning approaches. In this article a new sentence dataset for software qualityin-use is proposed. Three experts were chosen to annotate the data using a proposed annotation scheme. Then the data were reconciled in a (no match eliminate) process to reduce bias. The Kappa, statistics revealed an acceptable level of agreement; moderate to substantial agreement between the experts. The built data can be used to evaluate software quality-in-use models in sentiment analysis models. Moreover, the annotation scheme can be used to extend the current dataset. Keywords—Quality in use, Benchmark dataset, software quality, sentiment analysis

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Hybrid Framework for Building an Efficient Incremental Intrusion Detection System

In this paper, a boosting-based incremental hybrid intrusion detection system is introduced. This system combines incremental misuse detection and incremental anomaly detection. We use boosting ensemble of weak classifiers to implement misuse intrusion detection system. It can identify new classes types of intrusions that do not exist in the training dataset for incremental misuse detection. As...

متن کامل

Assessment of the completeness of Volunteered Geographic Information focusing on building blocks data (Case Study: Tehran metropolis)

Open Street Map (OSM) is currently the largest collection of volunteered geographic data, widely used in many projects as an alternative to or integrated with authoritative data. However, the quality of these data has been one of the obstacles to the widely use of it. In this article, from among the elements related to the quality of volunteered geographic data, we have tried to examine the com...

متن کامل

daQ, an Ontology for Dataset Quality Information

Data quality is commonly defined as fitness for use. The problem of identifying the quality of data is faced by many data consumers. To make the task of finding good quality datasets more efficient, we introduce the Dataset Quality Ontology (daQ). The daQ is a lightweight, extensible vocabulary for attaching the results of quality benchmarking of a linked open dataset to that dataset. We discus...

متن کامل

Building Dataset on User Study and Webgazer Analysis

I’ve helped fixed bugs and loggings in WebgazerPluPlus, and starting in the late March, Alexandra have conducted over 60 user studies on WebgazerPlusPlus. In the meantime, I’ve written codes to automatically process user study data, converting to a standard dataset. The dataset will set a benchmark for WebgazerPlusPlus, and our goal is to release the dataset to the public to encourage further i...

متن کامل

Employing Nonlinear Response History Analysis of ASCE 7-16 on a Benchmark Tall Building

ASCE 7-16 has provided a comprehensive platform for the performance-based design of tall buildings. The core of the procedure is based on nonlinear response history analysis of the structure subjected to recorded or simulated ground motions. This study investigates consistency in the ASCE 7-16 requirements regarding the use of different types of ground motions. For this purpose performance of a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1509.05736  شماره 

صفحات  -

تاریخ انتشار 2015